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Abstract 

A new achievability scheme for the compound channel with discrete memoryless (DM) state noncausally available 
at the encoder is established. Achievability is proved using superposition coding, Marton coding, joint typicality 
encoding, and indirect decoding. The scheme is shown to achieve strictly higher rate than the straightforward extension 
of the Gelfand-Pinsker coding scheme for a single DMC with DM state, and is optimal for some classes of channels. 



I. Introduction 

Consider the problem of reliable communication over a compound channel with discrete memoryless (DM) state, 
where a sender wishes to communicate a message to a receiver with the state sequence available noncausually at the 
encoder For simplicity we consider the case when the compound channel comprises only two discrete memoryless 
channels (DMCs) with DM state. This setup is essentially the same as sending a common message over a 2- 
receiver discrete memoryless broadcast channel (DM-BC) with DM state when the state in available noncausally at 
the encoder as shown in Figure [T] As such, we focus our discussion throughout the paper on this equivalent setup. 
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Fig. 1: Sending common message over DM-BC with DM state. 



The capacity for the single receiver case, widely referred to as the Gelfand-Pinsker channel, was established 
in m as 

Cgp- ^ max (/([/: F) - /(C/; 5)). 

The proof of achievability involves randomly generating a subcodebook for each message. To send a message, the 
sender finds a codeword in the message subcodebook that is jointly typical with the given state sequence. The 
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receiver decodes the codeword and hence finds the message. The details of the proof can be found, for example, 
in ||2l Lecture 7]. 

A straightforward extension of this Gelfand-Pinsker scheme to the DM-BC with DM state yields the lower bound 
on capacity 

C> max min{/(;7;Yi) -/([/;5),/([/;r2) -/(t/;^)}- (1) 

p{u\s)^ x[u.s) 

In O, it is conjectured that this rate is optimal in general. We show that this is not the case. We devise a new 
coding scheme for this channel that involves superposition coding, Marton coding, joint typicality encoding, and 
indirect decoding 1|4J. Our scheme yields the following lower bound on capacity. 

Theorem 1: The common message capacity of the DM-BC with state information available non-causally at the 
sender is lower bounded by 

C > maxmin{/(M^, U; Yi) - I{W, U; S), I{W, V; Y2) - I{W, V; S), 

i {I(W, U; Yi) - I{W, U- S) + I{W, V; Y2) - I{W, V; S) - I{U; V\W, S))}, 

where the maximization is over distributions p{w, u, v\s) and functions x{w, u, v, s). 

It is easy to see that this lower bound is at least as large as [T] We simply set [/ = y = 0. We will show that 
our lower bound can in fact be strictly larger than [T] 

In the following section, we formally define the problem of sending a common message over a DM-BC with DM 
state and describe the new coding scheme. In section |III1 we show through an example that the new lower bound 
can be strictly larger than the straightforward extension of the Gelfand-Pinsker result. In section IIVI we present 
several classes of channels for which the new rate is optimum, including a class of compound Gaussian channels 
where the new rate achieves the dirty paper coding rate Q for both channels simultaneously. 

The notation used in this paper will follow that of El Gamal-Kim Lecture Notes on Network Information 
Theory ^ Lecture 1]. 

II. ACHIEVABILITY SCHEME 

Consider a 2-receiver DM-BC with DM state {X,S,{p{yi,y2\x,s)p{s),yi,y2) consisting of a finite input 
alphabet X, finite output alphabets , 3^2, a finite state alphabet S, two a collection of conditional pmfs p{yi ,y2\x, s) 
on 3^1 X y2, and a pmf p{s) on the state alphabet S. 

A (2"^, ?i) code for the DM-BC with noncausal state information available at the encoder consists of: (i) a 
message set [1 : 2"^], (ii) an encoder that assigns a codeword x"{m, s") to each message m and state sequence s", 
and (iii) two decoders, decoder 1 assigns an estimate S [1 : 2"^] or an error message c to each received 

sequence y" and decoder 2 that assigns an estimate 7712(2/2) S [1 : 2"^] or an error message c to each received 
sequence 1/2 ■ We assume that M is uniformly distributed over [1 : 2"^]. The probability of error is defined as 
pi") = P{Mi ^ M or M2 ^ M}. 

A rate R is said to be achievable if there exists a sequence of (2"-R,7z) codes with Pi'"^ as 71 00. The 
capacity C is the supremum of all achievable rates. 

The main result in this paper is the lower bound on the common message capacity of the DM-BC with DM state 
available non-causally at the encoder in Theorem [T] The proof of this theorem follows. 

Codebook generation 

• For each m, generate 2"-^" w'"{m,lo) sequences according to YYi=iPw{wi). 

• For each {ni,lo) pair, generate 2"-^^ u"'{m,lo,h) sequences according to YVi=iPu\w{ui\wi). 

• For each (m,/o) pair, generate 2"^^ v'^{m,loj2) sequences according to nr=i Pv|w(^^l'"^^)• 
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Encoding 

The encoding procedure is illustrated in Figure |2] 

• Given message m and state sequence s", the encoder finds lo e [I : 2"^«] such that (w"(?o), s") G 7^' 
there is more than one Zq, it chooses the smallest one. If there is none, it chooses /q ~ 1- 

• The encoder next finds ?i £ [1 : 2"^i] and ?2 e [1 : 2"^^] such that 

(^"(m, /q), s", u"(m, /q, Zi), u"(m, /q, ^2)) G Til'"''. If there is more than one such pair, it chooses the pair 
with the smallest indices, first in ^i, then in l2- If there is none, it chooses (1, 1). 

• The encoder transmits x{wi,Ui,Vi, Si) for i G [1 : n]. 

Note that this scheme is essentially Marton coding with only diagonal product bins. Interestingly, the same encoding 
scheme can be used if we wish to send a common message Mq to both receivers and private messages Mi to Yi 
and A/2 to ¥2- 
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Fig. 2: Achievability scheme. 



Decoding 

Let e' > e > 0. 

• Decoder 1 finds m indirectly by decoding (to, Iq). It declares that rhi is sent if it is the unique message such 
that (w"(TOi,/o),u"(TOi,fo,fi),y'iO G 7^1"^ for some Iq G [1 : 2"^"], h G [1 : 2"'^i]. 

• Decoder 2 finds to indirectly by decoding (to, l^). It declares that TO2 is sent if it is the unique message such 
that («;"(TO2,/o),w"("i2,/o,?2),y2) G TS"^ for some [q G [1 : 2"^«], ^2 G [1 : 2"^^]. 

Analysis of probability of error 

An error may occur if either the encoder does not find a quadruple such that {w^{m, Iq), s^\u'^{m, Iq, li), 
v"(rn, Iq, I2)) G T}'^\ or there is an error made by decoder 1 or 2. 
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We now analyze the probability of error averaged over codebooks. Without loss of generality, assume M = 1 is 
sent and (Lq, ii, L2) are the corresponding indices. Define the encoding error events 

£m = {{S^,W^{IM)) i 7;'"^ for all ^o}, 

^02 = {(5", W^{l,Lo), [/"(I, io, ^i), io, ^2)) i for all hM} 

Then the total encoding error probability is 

By the covering lemma 12] Lecture 3], the first term P(foi) ^ as n ^ 00 if 

To > I{W;S). 
Next, consider the second probability of error term 

P(fo2nfo^) =P{(5",W^"(l,Lo),C/"(l,i^o,^i),^"(l,io,^2)) ^7;(") for all h,h} 

< P{M^"(l,Lo)=«;",5" = .s"}P{fo2(s",u;")}, 

(u)",s")eri"'(iv,s) 

where £m{s"'.w'^) denotes the event that {(S*" = s",W^"(l,?o) = w", [/"(I, Ji), io, ^2)) i T^*"^} for 

all 1 1 and Z2, conditioned on the fact that the pair 

We show in Appendix lAl that P(£o2(s", w")) ^ as n — > 00 if 

T^> I{U-S\W) + 5{e), 
T2 > I{V;S\W) + 5ie), 
Ti + Ts > /([/; S\W) + liV; S\W) + I{U- V\W, S) + (5(e). 

Next consider the probability of decoding error. Consider the following error events for decoder 1 

= {(5", W-il,Lo), C/"(l, Lo, L,),Yn i rj"^}, 
£x2 = {(5", [q), C/"(m, [q, ^i), I'l") € rj"^ for some [q e [1 : 2"^°], [1 e [1 : m ^ 1}. 

The probability of error restricted to £^^i for decoder 1 is upper bounded as 

P(^i) <P(fiin£o'i) + p(£i2). 

By the law of large numbers, the second term P(£ii n £01) — as n — s- cx). By the packing lemma 12] Lecture 
3], the third term P(fi2) — !■ as ?i — !■ cxd if 

R + Ta + Ti<I{W,U-Yi)~5{e). 

Similarly, the probability of error at decoder 2 tends to zero as n — !> cxd if 

i? + To + < I{W, V- Fa) - (5(e). 

Thus the overall probability of error tends to zero as n — > cxd if 

i? + ro + Ti </(w^,t/;ri), 

i? + To +T2 </(!¥, F;y2), 
To >/(!¥; 5), 
Ti >/([/; 5|VF), 

Ti+T2> I{U] S\W) + KV; S\W) + I{U; V\W, S). 

Performing Fourier-Motzkin Elimination on the stated rate constraints then gives the achievable rate stated in 
Theorem [U □ 
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Remarks: 

1) It suffices to set X as a deterministic function of W and S' in ([T]) and in Theorem 1. In ([T]l, if X is a 
probabilistic mapping of (VF, S), by the functional representation lemma JJ] it can always be expressed as a 
function of {W, S, Q), where Q is independent of (W, S). Defining W = {W, Q), we obtain X = x{W' , S), 
I{W';Yi) - I{W';S) > I{W;Yi) ~ I{W; S) and I{W';Y2) - I{W'- S) > I{W;Y2) - I{W; S). Similar 
reasoning can also be applied to Theorem 1. 

2) Theorem 1 can be readily extended to any finite number of receivers (equivalently, compound channel 
comprising a finite number of DMCs with DM state). In this case we have the common auxiliary random 
variable W and as many individual auxiliary random variables as the number of receivers. 

III. Example 

We now show through the example in Figure [3] that the achievable rate in Theorem [T] can be strictly larger 
than the rate achievable by the straightforward extension of the Gelfand-Pinsker coding scheme to the 2-receivers 
DM-BC with state given in [H which we denote by Rgp ■ 

We have \X\ = = |3^2| = \S\ = 2 and P{S = 0} = 1/2. The top half of the example corresponds to the 
channel transition probabilities when S = while the bottom half corresponds to the channel transition probabilities 
when S = 1. 



5 = 



5=1 




Fig. 3: Example DM-BC with DM state. 

From Theorem[T] we set 1^ = 0, [/ = Yi, V = Y2 and P{X = 0\S = 0} = P{X = 0\S = 1} = 0.5. It is easy 
to verify that this choice of auxiliary random variables gives us an achievable rate of i? = 0.5. It is also clear that 
C < I{X; Y\S) = H{Y\S) = 0.5. Therefore, Theorem [T] achieves the common message capacity for this example. 

A. Rgp < C 

Expanding I{U; Yi) - /([/; S) in[T] we obtain 

/([/; Yi) - /(C/; S) = I{U; Y,,S)- I{U- S) - I{U- S\Y^) 
= H{Yi\S) - H{Yi\U, S) - I{U;S\Yi) 

<H{Y,\S)<^. 

To achieve Rqp = H{Yi\S), we require that U — 5- Yi ^ 5 form a Markov chain and Yi a function of {U,S). 
Since Yi = X when 5 = 0, we require that X is a function of U when 5 = 0. Similarly, from I{U; Y2) — I{U; S), 
we require U — > Y2 — !> 5 and Y2 a function of {U, S). This implies that X is a function of (f/, S). To further 
achieve Rqp = 0.5, we require that P{X = 0\S = 0} = P{X = 0\S = 1} = 0.5. 
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Let 
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= 0,5 = 
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Since X is a function of {U, S), at least one of the two parameters and bi is equal to zero and at least one 
of Ci and is also equal to zero. Further, from the Markov chain conditions P{U = i\Y2 = 0,5 = 0} = P{U = 

i\Y2 = 0, 5 = 1} and P{U = i\Yi = 1,5 = 0} = P{U = i\Yi = 1,5= 1}, we obtain 

Qj + h _ 
Ci + di 



If fli = 0, hi = 2ci and = Sq. Since one of Ci,di = 0, this means that = 6^ = q = = or P{U = i} = 0, 
which is a contradiction. Similarly, 6^ = forces P{U = «} = 0, which is again a contradiction. This shows that 
there is no U with the required properties. Hence, i?GP < C. 

In fact, by means of a symmetrization argument given in Appendix |B] we can show that i?GP can be computed 
exactly and is approximately equal to 0.41, implying a gap of 0.09 from C. 

IV. Special Classes of Channels 
Theorem [l] achieves the common message capacity in the following cases. 

A. A class of deterministic channels with state 

If both Yi and Y2 are functions of (X, 5) and /(Yi; ^2 15) = 0, then 

C= maxmin{if(ri|5),i7(y2|5)}. 

p{x\s) 

The example given in Section |III]belongs to this class of channels. Achievability follows from Theorem[T|by setting 
= 0, [/ = Yi and F = Y2. The converse follows from the fact that C < maxp(^|,) min{/(X; ri|5), /(X; ^215)}. 
Remark 1: One can also generalize this result to the class where Yj and Y2 are functions of {X, 5); Yi and 
Y2 share common information (in the sense of Gacs-Korner), i.e. there exists Z = f{Yi) = .g(Y2), and further 
/(Yi; Y2I5, Z) = 0. The achievability follows from Theorem [U by setting W = Z, U = Yi and V = Y2. 



B. A class of compound Gaussian channels 

We now develop a Gaussian analog of the example in Section |IV] Let 5 
Zs ^ N{0, Qt)- The channel is defined as follows. When T = 0, we have 

Yi = .giX + Zs + Zi, 



(T, Zs) where T ~ Bcrn(a) and 



>2=0, 



where Zi ~ iV(0, 1). When T = 1, we have 



Yi 
Y2 



0, 

.92^ 



Zo 



where Z2 ^ A^(0,1). The random variables [T, Zs), Zi, Z2 are mutually independent. Since Zs ^ N{0,Qt), 
we may have different variances in different states. Further, we assume an average transmit power constraint: 
Er=i E(x2(m, 5")) < nP, m G [1 : 2"-"]. 

An upper bound on the capacity of this channel is 



C < 



p{x\ 



max min{/(X; Yi|5),/(X; Y2I5)}. 

):E(X2)<P 
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It is easy to show that I{X;Yi\S) < aC(.g?Pi) and I{X;Y2\S) < aC(g|P2), where aPi + aP2 ^ P and 
C(P') = (l/2)log(l + P'). From the writing on dirty paper resuh [O, in the single state case, the rate is C(P). 
Can we achieve the dirty paper coding rate for both Yi and Y2 simultaneously for this more comphcated class of 
compound Gaussian channels? 

Using Theorem [1] we set W ~ T. When T = 0, we set 

U = Xo + -^^^Zs, and V^T, 
1 + giPi 

where Xi - N{0,Pi). When T = 1, we set 

[/ = T, and y = + Zs, 

1 + 32^2 

where Xi ^ iV(0, P2) and aPi + aP2 = P. This choice of random variables gives us the following achievable rate 

R < I{T- Fi) + /([/; Yi|T) - /([/; Zs\T) - H{T), 
R < I(T; Y2) + I{V- Y2\T) - I{V; Zs\T) - H{T), 
2R < I{T; Yi) + I{U; Yi \T) - I(U; Zs\T) - H{T)+ 

I{T; Y2) + I{V; Y2\T) - I{V; Zs\T) - H{T) + I{U- V\T, Zs). 

Since /(T; Yi) = /(T; Y2) = H{T) and I{U; V\T, Zs) = 0, simphfying the expression gives us 

R< max Ta:v[i{aC{glPi),aC{glP2)}, 

aP\+aP2—P 

which shows that we can achieve the dirty paper coding rate for both channels simultaneously. 

V. Conclusion 

We established a new achievable rate for the compound channel with DM state available noncausally at the 
encoder The new achievable rate is shown to be strictly larger than the straightforward extension of the Gelfand- 
Pinsker coding scheme for a single state case. This result also implies that the straightforward extension of the 
Gelfand-Pinsker coding scheme for transmission over a DM-BC with DM state is not optimum. 
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Appendix A 
Bounding P(fo2(s",-;«")) 

The technique we use for bounding the term P(fo2(s", w"^)) is similar to that in the proof of the mutual covering 
lemma in [2] Lecture 9]. 

P(fo2(s", u;")) is given by the probability of the event: {s", u;", [/"(^i), ^"([2)) i T^'^} for all [1 G [1 : 2"^i] 
and ^2 G [1 : 2"^^]; where f/"(/i) and V^"(/2) are independently generated, conditioned on the given w" , according 
t° YVi=iPu\w{K-t\wi) and YVi=iPv\w{'"i\wi) respectively. Note that we are given (.s",w") G 7^'^"''. 
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To show that P{£a2) as n ^ oo, M A = {{h.h) ■ (s", w", C/"(/~i), t/"([2)) e T;^"-*} and I{h,l2) = 1 if 
(s", w", C/"(Zi), V^ih)) e 7;^"^ and otherwise. Then, |^| = J^i, u ^ihM) and the expected number of jointly 
typical sequences is given by 

E 1^1 - Y. 6 

We further have the following bound on the probability: 

^ p(u")p{(s^u;^t/"(^l),T/"(/•2)) i;7"(z'i) = it"} 

n 

i"er,'"'(C/|tu",s") ' = 1 

M"er/"'(c/|iu",s") 
_ 2-"(-''(t^;'S'|M')+-f(s';^|M')+-f(c^;^|M'.s')) 

Hence, we have 
Next, let 

=P{(,s",«;",J7"(l),l/"(l)) £?;(")}, 
P2 = P{(.s",«;",[/"(1),1/"(1)) e (s",u;",C/"(l),l/"(2)) e T;^")} 

E M"")P{(s",ti'","",^^"(i)) er/")}p{(s",u;",u",F"(2)) 

i'i"Gri"'(;7|ju",s") 

2-"(-f('^;'S|M')+2-f(V;S|H')+27(;7;y|H',S)-<5(e)) 

P3 = P{(s",?«",!7"(l),l/"(1)) e (s",u;",C/"(2),T/"(l)) e T.^")} 

^ p(C")P{(s",u;",i)",i7"(l)) G7;("'}P{(s",u;",S",t/"(2)) er,(")} 

S"Gri"'(C/|to",s") 
^ 2-"(^(^:'S|^)+2-f('^;S'|M')+2^('^;V|H',S)-<5(e)) 

P4 = P{(s",w",J7"(l),F"(l)) G (s",u;",c7"(2),F"(2)) G 7;^"^} 

Note that E |^| = 2"('^i+'^^)pi. 

E i^p = 2«(-^+-^)p, + E + E E ^3 + E E E 

Hence, 

Var(l^l) < 2"(^i+2^^)p2 + 2"(2Ti+T.)p^ _^ 2"(^i+^^V. 

By Chebychev's inequality, we have 

P{|^| = 0} < P{(|^| - E 1^1)2 > (E 1^1)2} 
^ Var(l^l) 
- {E\A\y 

2-"iTi-I{UiS\W)-5{€)) _|_ 2-n(T2-I{V:S\W)-5{e)} 
_|_ 2-"(Ti+T2-7(C/;S|W)-/(y;S|W)-7(C/;ylW,S)-(5(e)) 
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Hence, P{|^| = 0} ^ as n ^ oo if the following conditions are satisfied 

Ti > I{U;S\W) + 5{e) 
T2 > I{V;S\W)+S{e) 
Ti+T2> I(U; S\W) + KV; S\W) + I{U; V\W, S) + (5(e). 

Hence P(£o2(s", w")) goes to as n —5- 00, provided the above conditions are satisfied. 

Appendix B 
Exact evaluation of Rqp 

In this apendix, we evaluate Rqp using a symmetrization argument. Consider any {U,S,X) defined by P{U = 
= 0} = u,,F{U = i,S =1} = Vi,F{X ^ 0\U ^ i,S = 0} ^ a^,P{X ^ 0\U = i, S = 1} = 1 - h. From 
the fact that it suffices to look at X = f{U, S), we have a^, bi £ {0, 1}. 
Then the following holds 

H{Y) = H{j:^u,a.), H{Y\U) = J2^(u, + v,)H 

HiS) = 1, HiS\U) = J2,iu, + u,)Hi^^ 

H{Z) = HiY^^uM). H{Z\U) = + 

Now define a {U',S,X') ([/' of size 2\U\) according to: 

V{U' = (i, 1), ^ = 0} = ?i,/2, V{U' = (i, 2), 5 = 0} = v,/2, 
V{X' = 0|[/' - (i, 1), S = Q}= a„ P{X' = 0|C/' - (^, 2), 5 = 0} = h, 
V{U' = (i, 1), 5=1} = i;,/2, ?{[/' = (i, 2), 5=1} = w,/2, 
V{X' = 0|C/' = (z:, 1), 5 = 1} = 1 - b^,¥{X' = 0|C/' = (z, 2), 5 = 1} = 

Then observe that the new entropies are 



+ ^1 >\{H{Y) + H{Z)), 



His) = 1, 

HiS\U') = y^iu, + v,)H (^^] = HiS\U) 



H{Z') = /.(^ + > LiHiY) + HiZ)), 

i 

H{Z'\U') = E + «0 f f-^) + H (-^)) = \{HiY\U) + HiZ\U)). 



Thus, I{\J'; Y') - I{U'; S) = I{U'; Z') - I{U'; 5) > i {I{U; Y) - /([/; 5) + /([/; Z) - /([/; 5)) . 

A. Maximization ofI{U'; Y') - I{U'; 5) 

Our maximization problem reduces to maximizing 

I{U';Y')~IiU';S) 
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over all pmfs with the stated U' structure. That is, we wish to maximize 



£1 



Ui + Vi 

subject to Ui = 0.5, Vi = 0.5, a^, 6,; G {0, 1}. The term can be rewritten as 



u, + v,)[H\ — -H 



Ui +Vi J \Ui+ V, 

Let I be the set of indices where — and J be the set of indices where bi ~ 0. This implies that on we 
have ai = 1 and on J"^ we have hi ~ \. 
Thus, we wish to maximize 



Ui 



Ui + Vi 



I V Ui + Vi 



subject to Y,i = 0-5, Z^i = O-^- 
Define the following: 



Xi 

2 


= E 


yi _ 

2 


^ E 


X2 

2 


= E 


2 


= E 


2 


- E 


2 


= E 


Y 


= E 


2/4 

2 


= E 



Observe that = 1, yi = 1. 

We note the following as a consequence of the concavity of the entropy function. 



.^^2 \Ui + ViJ V + 



^ xi+yi ^ / xi \ ^ X2+y2 ^ f X2 \ ^ X3 + _^ f X3 



2 \xi+yij 4 \X2 + y2j 4 \X3 + y3 

Therefore we can upper bound the true maximum by the maximum of 

^ / X3+X4 _^ y2+y4 \ _^ Xi+yi ^ / Xi \ ^ X2_+y2jj f X2 \ ^ X3 + 2/3 ^ / X3 _ ^ 

V 4 4 y 2 U-i+2/1/ 4 ^2 + 2/2/ 4 Us + 2/3. 

subject to = Si Vi = 1 and x^,yi> 0. 
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Now, we relax this maximization to xi + yi = 2 and Xi,yi > 0. 

Define the partial sums si = xi +yi, S2 ~ X2+y2, S3 = 2:3 + j/3, and S4 = 2:4 + 2/4. We re-write the maximization 

as 



4 4 / 2 V'"^!/ 4 V'52/ 4 

subject to < xi < si, < y2 < S2, < ^3 < S3 and J^i = 2- 

Using concavity of entropy, we can bound the maximum of the above expression by the maximum of 



4 4 J 2 \si J 4 V52 + S3 

subject to < xi < si, < y2 + 2^3 < S2 + S3 and = 2- 

We first maximize with respect to xi and y2 + x^ keeping the Si terms fixed. Observe that the maximization is 
separable and it is concave in xi and y2 + X3. Hence the maximum occurs when the first derivatives are zero; i.e. 
a;i - — ana — + — ^ - i - ^^qi^j. 

The second condition implies that 

, ^ ,,1 , 1 ^ , S4 ^ (4-S4)(s2 + S3) 

2/2 + 2:3 - + — - — = 1 - -r> or 2/2 + 3:3 = — — . 

4 S2 + S3 4 4 + S2 + S3 

Substituting for the optimal choices of xi, 2/2 +2:3, the maximization reduces to that of 

2 4 4 + S2 + S3 

subject to J2i Sj — 2, and Si > 0. 

Denote S2 + S3 = t and rewrite the maximization as 



AJ V 4 + t / 2 2 
subject to < t, < S4, S4 + t < 2. 
We divide into four cases: 

1) The maximum is achieved at some strictly internal point, i.e. no inequality is tight. 

2) The maximum is achieved when t = 0. 

3) The maximum is achieved when S4 = 0. 

4) The maximum is achieved when < + S4 = 2 but neither t or 54 is zero. 

It is not difficult to verify that the maximum over all four cases is attained by Case 3, with the setting t = |, S4 = 0, 

2 

3' 



and si = |. The maximum value is approximately 0.41. 



B. The maximizing p{u, s), s) 

We now show all the relaxations can be made tight, i.e. there exists a suitable choice of U that achieves the 
derived bound. 

Consider the following \U\ with cardinality 3 defined according to; 



F{U = 


1,5 = 


0} = 


-,P{X = 
6' ^ 


o\u = 


1,5 = 


0} = 


0, 


P{U = 


1,5 = 


1} = 


-,P{X = 
6' ^ 


1\U = 


1,5 = 


1} = 


0, 


P{U = 


2,5 = 


0} = 


12' ^ 


= o\u -- 


= 2,5 = 


= 0} = 


= 0, 


P{[/ = 


2,5 = 


1} = 


l,P{x = 


1\U = 


2,5 = 


1} = 


1, 


P{U = 


3,5 = 


0} = 


\,P{X = 


o\u = 


3,5 = 


0} = 


1, 


P{U = 


3,5 = 


1} = 


12' ^ 


= l\U- 


= 3,5 = 


= 1} = 


= 0. 
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For this channel observe that 



I{U;Y)-I{U;S)^I{U;Z)-I{U; 
w 0.41. 
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